Automatic speaker identification for a large population
نویسندگان
چکیده
منابع مشابه
Automatic speaker identification for a large number of speakers
Design of speaker identification systems for a small number of speakers (around 10) with a high degree of accuracy has evolved over the past few years. A sequential identification technique gives better results when the number of speakers is large. This scheme is implemented as a decision tree classifier in which the final decision is made only after a predetermined number of stages. The error ...
متن کاملAutomatic Speaker and Language Identification
This thesis deals with the problem of automatic language identification (LID) and automatic speaker identification (SID) given the speech signal as input. Both researches have received renewed interest due to heightened homeland security awareness, e.g. in the use of speaker’s voice print for biometric identification, language identification for the classification of speech archives and call-sc...
متن کاملVoxCeleb: A Large-Scale Speaker Identification Dataset
Most existing datasets for speaker identification contain samples obtained under quite constrained conditions, and are usually hand-annotated, hence limited in size. The goal of this paper is to generate a large scale text-independent speaker identification dataset collected ‘in the wild’. We make two contributions. First, we propose a fully automated pipeline based on computer vision technique...
متن کاملA fast search method of speaker identification for large population using pre-selection and hierarchical matching
Performance of search during matching phase in a speaker identification system realized through vector quantization (VQ) is investigated in this paper. Voice of each person is recorded in a office room with personal computers. LPC−cepstrum is selected as feature vector. In order to gain higher success rate of identification, it is necessary to use larger size codebook for each person. Consequen...
متن کاملComputationally Efficient Speaker Identification for Large Population Tasks using MLLR and Sufficient Statistics
In conventional Speaker-Identification using GMM-UBM framework, the likelihood of the given test utterance is computed with respect to all speaker-models before identifying the speaker, based on the maximum likelihood criterion. The calculation of likelihood score of the test utterance is computationally intensive, especially when there are tens of thousands of speakers in database. In this pap...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Acoustics, Speech, and Signal Processing
سال: 1979
ISSN: 0096-3518
DOI: 10.1109/tassp.1979.1163238